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WATERMARK EMBEDDING AND DETECTING METHOD BY A QUANTIZATION 
OF A CHARACTERISTIC VALUE OF A SIGNAL 



5 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present patent application claims priority from 
Korean Patent Application No. 2003-0021827, filed on April 8, 
2003 . 

10 BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to watermarking method 
and device, and more particularly a watermarking method 
15 which is capable of integrity authentication by identifying 
forgery/alternation of digital audio signals, and device 
thereof . 

Description of the Related Art 

20 Watermarking technology is used in various fields such 

as broadcasting monitoring, owner identification, 
authentication, fingerprinting for tracing illegal 
circulation, covert communication, copy control, etc. The 
requirements for watermarking technology used in these 

25 applied fields may differ for each field; however, in common, 
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the difference between the original data and the watermarked 
data should not be identified by the human five senses. 

Among these various applied fields, authentication is 
one of the applied fields which are recently gaining 
5 attention. Systematic research on authentication has been 
performed for a long period of time in the field of 
encryption. The first person to bring up the problem of 
authentication in the field of watermarking was Friedman 
(U.S. Patent - 5,499,294). He proposed that authentication 

10 for digital images will be possible by embedding an 
encrypted signature extracted as the characteristic value of 
an image into an image data. In such case, even if a single 
pixel of information is changed, it is impossible to detect 
the signature which corresponds to the embedded encrypted 

15 signature. Thus, no manipulation is allowed. In addition, 
Lin & Chang proposed an authentication method wherein the 
embedded data are not changed by harmless data manipulation 
such as JPEG compression, whereas the embedded data are 
changed by other attacks such as the addition, deletion or 

20 alternation, etc. of a part of the data. 

Among the various applied fields of watermark, the 
present invention focuses on research on authentication. 
Research on the previously developed authentication 
technology was mainly directed to image and video, and there 

25 was almost no authentication technology related to voice 
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signal and audio signal. Recently, as a voice recording 
device is changing from an analogue recording device into a 
digital recording device, authentication on audio signals is 
being required. Thus, along with the development of digital 
5 voice recording devices using voice recorders and MP3 
players, the necessity of authentication is increasing. 
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SUMMARY OF THE INVENTION 

Technology for identifying forgery/alternation of 
audio signals should provide a function which can sense 
whether the original content has been manipulated when a 
5 part of any data in the recorded audio signal data has been 
changed, or when any data is added to the audio signal data, 
or when a part of the audio signal data is deleted. Further, 
it also should provide information for understanding its 
original meaning by inferring the location of the 

10 f orgery/alternation, and the form of manipulation. 

The technical characteristics required to attain the 
above object includes, inaudibility of the watermarking data 
embedded, robustness against compression, tamper resistance 
for preventing the exposure of watermarking technology, and 

15 reliability capable of embedding and extracting various 
patterns, etc. Further, on the premise that it should be 
inserted as a module into ordinary household appliances, 
quick processing is required so that real-time processing 
for realizing hardware is possible and a limited amount of 

20 memory is to be used. 

An object of the present invention is to provide a 
method for watermarking data which can satisfy the above 
content, and device thereof, and more particularly to 
provide a method for watermarking appropriate for preventing 

25 and detecting forgery/alternation of audio signals, and 
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device thereof . 

In order to attain the above object, the method for 
watermarking in accordance with the present invention 
comprises steps of evaluating a characteristic value, 
5 determining a quantized value, evaluating an intensity of 
insertion, and modifying the signal. 

In the step of evaluating a characteristic value, a 
characteristic value for a signal within a frame obtained by 
segmenting the signal to be watermarked in a predetermined 
10 time period is evaluated in a predetermined manner. 

In the step of determining a quantized value, a 
quantized value most closely approximated to the 
characteristic value is determined by comparing the 
characteristic value with the quantized value within a set 
15 among a plurality of sets including one or more quantized 
value respectively, the set corresponding to a value of 
pattern information embedded into the frame. 

In the step of evaluating an intensity of insertion, an 
intensity of insertion used in order to modify the signal 
20 within the frame so that the characteristic value is the 
same as the quantized value is evaluated. 

In the step of modifying the signal, the signal is 
modified within the frame based on the intensity of 
insertion. 

25 At this time, the method may further comprise a step 
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of filtering the signal through a predetermined range of 
frequency before the step of evaluating a characteristic 
value, wherein the characteristic value for the filtered 
signal is evaluated. 
5 Also, the method may further comprise a step of 

detecting a silent part within the signal, wherein the step 
of evaluating the characteristic value to the step of 
modifying the signal may performed only for a frame 
including the signal excepting the silent part. 

10 It is preferable for the pattern information embedded 

as a watermark to include an error detecting code or an 
error correcting code, and a synchronizing signal. 

The pattern information may consist of one bit for 
each frame, or a plurality of bits for each frame. The 

15 method for inserting a plurality of bits into each frame may 
comprise a step of filtering the signal through a plurality 
of ranges of frequency with a respectively different range 
of band before the step of evaluating the characteristic 
value, wherein the plurality of bits is inserted 

20 respectively into each of signals filtered through the 
plurality of ranges of frequency. 

The method for detecting a watermark in accordance 
with the present invention comprises steps of evaluating a 
characteristic value, determining a quantized value, and 

25 extracting a pattern information. 
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In the step of evaluating a characteristic value, a 
characteristic value for the signal within a frame obtained 
by segmenting the signal in a predetermined time period is 
evaluated in accordance with the same manner as in the step 
5 of evaluating the characteristic value in embedding the 
watermark. 

In the step of determining a quantized value, a 
quantized value most closely approximated to the 
characteristic value is determined by comparing the 
10 characteristic value with each quantized value within a 
plurality of sets of the quantized values used for a 
quantization of the characteristic value in embedding the 
watermark. 

In the step of extracting pattern information, a value 
15 corresponding to the set of quantized values involving the 
quantized value determined in the determining step is 
extracted as a pattern information inserted into the frame. 

If the signal has been filtered when the watermark is 
being embedded, it is preferable for the signal to be 
20 filtered when extracted. At this time, if the signal is 
filtered through a plurality of ranges of frequency when the 
watermark is embedded so that a plurality of bits are 
respectively embedded as a pattern information into each 
range of frequency, it should also be filtered through a 
25 plurality of ranges of frequency when extracted so that a 
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pattern information is extracted from each range of 
frequency. 

In accordance with the present invention, more 
particularly a method for embedding and detecting watermark 
5 appropriate and reliable for authenticating audio signal by 
quantizing the characteristic value of the signal is 
provided. 

The present invention proposes an audio watermarking 
technology which uses the quantization of audio 

10 characteristic value. In accordance with the experimental 
results, it can be verified that the technology proposed in 
the present invention is very robust against various lossy 
compression, and that an original song and a watermarked one 
were almost indistinguishable. In accordance with the 

15 technology proposed in the present invention, the detecting 
rate for audio signals which have gone through the regular 
compression is higher than 88%, and the detection rate for 
all other attacks except pitch shift is higher than 80%. SNR 
which was presented as the standard for evaluating the 

20 quality of sound is about 49-64 dB, and thus maintains a 
level almost the same as that of the original sound. This 
shows that even experts could not easily distinguish the 
original song from the watermarked song. 

In accordance with the present invention, reliability 

25 is given to the voice data stored in a digital format. Data 
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stored as hardware products in a digital format such as 
digital cameras or digital voice recorders, can be 
manipulated and altered, and thus do not have legal force, 
and cannot be used as any kind of evidence material. If the 
5 present invention is realized in a hardware to operate in 
ordinary household appliances, the presence of 

forgery/alternation of the audio signal stored in a digital 
format can be sensed. Therefore, the integrity 

authentication of content for digital data which were not 

10 reliable due to the fact that the data could be easily 
manipulated became possible. In particular, recently, as 
appliances such as MP3 players, telephone counseling service, 
voice recorders, etc. are widely being used, the amount of 
data stored in a digital format is increasing in geometric 

15 progression, and thus the present invention will be widely 
applied. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of the watermarking system 
performing the method for embedding and detecting watermark 
5 in accordance with the present invention; 

Fig. 2 is a flow chart of the method for embedding 
watermark in accordance with the present invention; 

Fig. 3 is a drawing illustrating the frame of the 
signal to be watermarked; 
10 Fig. 4 is a flow chart of the method for detecting 

watermark in accordance with the present invention; and 

Fig. 5 is a drawing illustrating the mutual 
relationship between a detected characteristic value and a 
quantized value. 

15 
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DETAILED DESCRIPTION OF THE INVENTION 



Preferable embodiments of the present invention will 
be described in detail with reference to the drawings in the 
following . 

5 The present invention proposes a technology for an 

integrity authentication of content for voice signals based 
on the watermarking technology using the group quantization 
of the audio characteristic value. The embedment and 
extraction of a watermark in accordance with the present 

10 invention are performed by the watermarking device having a 
basic configuration as shown in Fig. 1. 

Referring to Fig. 1, the watermarking device largely 
comprises an embedding part 100 for embedding a watermark 
and an extracting part 2 00 for identifying 

15 forgery/alternation. 

The embedding part 100 comprises an audio signal input 
device 110, watermark pre-detecting part 120 and watermark 
embedding part 130. The digital audio signal (PCM data) 
outputted by the audio signal input device 110 is inputted 

20 into the watermark pre-detecting part 120. The WATERMARK PR- 
DETECTING PART 120 verifies whether the inputted audio 
signal is the data wherein a watermark is already embedded. 
If it is determined in The WATERMARK PR-DETECTING PART 120 
that the audio signal is a data wherein a watermark is not 

25 embedded, the audio signal is transmitted to the watermark 
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embedding part 130. The watermark embedding part 130 insert 
the pattern signal as a watermark into the audio signal. The 
audio signal wherein the watermark is embedded is stored in 
a storage media 150 of the digital voice recording device. 

The extracting part 200 comprises a watermark 
extracting part 210 , watermark pattern pre-processing part 
230 and a forgery/alternation area detecting part 240. The 
watermark extracting part 210 extracts the watermark 
embedded into the audio signal inputted from the storage 
media 150 to obtain a pattern signal. The watermark pattern 
pre-processing part 230 purifies the pattern signal 
distorted by the incorrect -extraction generated during the 
watermark extracting process by removing the noise included 
in the extracted pattern signal. The forgery/alternation 
area detection part 240 obtains detailed information on the 
presence of forgery/alternation, location of manipulation, 
etc. by using the purified pattern signal. 

The embedment and extraction of watermark in 
accordance with the present invention are performed by the 
above watermarking system. The embedment and extraction of 
watermark in the present invention will be described in the 
following . 

The semi -fragile watermarking technology is generally 
used for watermarking which is used for determining the 
presence of forgery/alternation. This is because watermark 
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is not removed by normal acts for storing data such as 
transforming file format or compressing files, whereas, 
watermark is removed by cropping, adding signal, attacks 
largely affecting the quality of sound, etc., and thus when 
5 the original voice signal differs from what it is intended 
to transmit, it can be determined as forgery/alternation. 

For image data, a method comprising steps of 
segmenting the full image into small blocks, extracting the 
characteristic value for each block, and storing the 

10 characteristic value in a corresponding block using 
watermarking technology is used. If an image of a part of an 
area is replaced with another image or removed, no watermark 
information corresponding to the characteristic value can be 
detected and thus forgery/alternation can be determined. 

15 Similarly, the present invention is based on the 

method that audio signals are segmented into a predetermined 
size of frame and the bit string of pattern information 
already generated as watermark is inserted into each frame 
in a regular sequence to identify forgery/alternation 

20 generated in audio data. 

Fig. 2 is a flow chart of the method for embedding 
watermark in accordance with the present invention. 

The method for embedding watermark in accordance with 
the present invention comprises the steps of band-pass 

25 filtering for audio signal (S110) , evaluating a 
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characteristic value of the filtered audio signal (S120) , 
determining the level of quantization for the evaluated 
characteristic value (S130) , evaluating the intensity of 
insertion corresponding to the determined level of 
5 quantization (S140) , embedding the watermark using the 
evaluated intensity of insertion (S150) and recording the 
audio signal wherein the watermark is inserted into a 
storage media (S160) . 

Each step of the procedure of embedding a watermark 
10 can be subdivided and described in detail as follows. 
<Step 1>: Step S110 

Frame segmenting and band-pass filtering 

Fig. 3 is a drawing illustrating the frame of the 
audio signal. As shown in Fig. 3, the audio signal is 

15 segmented into a predetermined length of frame (..., Fi-i, F±, 
F i+:L/ ...) . At this time, the length of each frame should be 
shorter than 100 ms. Any frame (F ± ) is segmented into two 
areas of the same length and each area is named "range A", 
"range B" , respectively. 

20 After segmenting the audio signal into a predetermined 

size of frame, the corresponding audio signal of the frame 
(Fi) is band-pass filtered. Band-pass filtering is a process 
for extracting the characteristic value of a reliable audio 
signal. A band signal between approximately 2 kHz and 4 kHz 

25 is used and a band-width of at least 1 kHz is the most 
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appropriate . 

The pattern information of the watermark to be 
embedded is predetermined, and one bit in the pattern 
information is inserted into one frame (Fi) of the segmented 
audio signal. For example, if the pattern information of the 
watermark to be embedded comprises {l, 0, 0, 0, .... , 1, l}, 
a value corresponding to "1" is inserted into the first 
frame of the audio signal, and a value corresponding to "0" 
is inserted into the second frame. After a series of bit 
string comprising the pattern information is inserted into 
each frame in a regular sequence as above, a bit string of 
the same pattern information is inserted repetitively from 
the next frame . 

As above, pattern information comprises bit strings 
consisting of "1" or "0" whose length are about 20-40, and 
is repetitively inserted into the audio signal. 
<Step 2>: Step S120 

Evaluation of the characteristic value of the frame 

The characteristic value F of one frame (Fi) is 
evaluated. In order to evaluate a characteristic value F, 
first the sum total of the square of the audio signals of 
range A and range B is obtained as foil ows [Equation 1] . 
[Equation 1] 

15 



Herein, i-1, i-1/2 and i respectively mean notations 
indicating a starting point of range A, an ending point of 
A (or a starting point of range B) , and an ending point of 
range B. Also, s(t) means the filtered audio signal- The 
[Equation 1] is a random definition and it can also be 
defined differently as the following. 

Next, the characteristic value F of the audio signal 
can be obtained as in the following [Equation 2] using S A and 
S B . 

[Equation 2] 

JF^ — ^ 

The embedment of the watermark in accordance with the 
present invention is performed by modifying the original 
audio signal s(t) so that the characteristic value F is 
changed to a quantized characteristic value F' as follows. 

<Step 3>: Step S130 

Determination on the level of quantization of the 
characteristic value F 

In order to embed the pattern information, the 
quantization standard value into which the characteristic 
value F of the audio signal should be changed is determined. 
First, the quantization standard value of set Q 0 and Qi is 
defined as in the following [Equation 3] . 
[Equation 3] 
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Go - [-0.7, -0.3, 0.1, 0.5, 0.9] 
Q x - [-0.9, -O.S, -0.1, 0.3, 0.7] 

If the bit information corresponding to the pattern 
information (that is, the bit information to be inserted 
into each frame) is "0", the characteristic value F obtained 
from [Equation 2] is quantized to the value closest to the 
value of the elements of set Q 0 of [Equation 3] . If the bit 
information is "1" , the characteristic value F is quantized 
to a value closest to the value of the elements of set Q x . 

For example, if the characteristic value F obtained 
from [Equation 2] is 0.15, the value closest to 0.15 in Q 0 is 
0.1 and the value closest to 0.15 in Qi is 0.3. Therefore, 
if the value of the bit corresponding to the pattern 
information to be inserted into one frame (Fi) is u 0", the 
quantized value Q of the characteristic value F is 0.1, and 
if the value of the bit corresponding to the pattern 
information to be inserted into one frame (Fi) is "1", the 
quantized value Q of the characteristic value F is 0.3. The 
original audio signal s(t) should be modified so that it has 
a characteristic value which is the same as the quantized 
value Q determined as above. At this time, the method for 
modifying the original audio signal goes through the 
following step 4 comprising steps of obtaining the intensity 
of insertion g and modifying the original audio signal s(t) 
according to the intensity of insertion g. 
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<Step 4>: Step S140 

Evaluation of intensity of insertion g 

In order to quantize the audio characteristic value F, 
the original audio signal s(t) is modified as in the 
5 following [Equation 4] . 
[Equation 4] 

RANGE A : *'CO = ^CO & • ^CO 
RANGE B : ^'CO = ^CO - & • ^CO 

Herein, s' (t) is an audio signal modified so as to 
obtain a quantized characteristic value F' , g is an 
10 intensity of insertion for modifying, added to the original 
audio signal s(t) in order to modify the original audio 
signal s(t) so that the audio signal s'(t) after 
modification has a quantized characteristic value F' as 
above . 

15 The above [Equation 4] means that when modifying the 

original audio signal s(t) so that the characteristic value 
F obtained from [Equation 1] and [Equation 2] has a 
quantized characteristic value F' , in range A within the 
frame (Fi) a signal modifying the original audio signal s(t) 

20 as much as the intensity of insertion g is added to the 
original audio signal s(t), and in range B a signal 
modifying the original audio signal s(t) as much as the 
intensity of insertion g is subtracted from the original 
audio signal . 
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The intensity of insertion g is obtained by the 

following mathematical process. 

The characteristic value F' of the modified audio 

signal s' (t) and the determined quantized value Q are 

identical, and thus they have the following relationship as 

[Equation 5] 

[Equation 5] 
F' = Q 

For selecting the intensity of insertion g to satisfy 
the above [Equation 5] , [Equation 1] is inserted into 

[Equation 2], and s(t) in [Equation 2] is substituted with 
s' (t) in [Equation 4] . Thus, the following [Equation 6] is 
obtained . 

[Equation 6] 

Herein, Si(t) and s 2 (t) represent audio signal s (t) in 
range A and range B, respectively, and S' A and S' B are values 
obtained respectively from [Equation 1] with regard to audio 
signal s' (t) after modification. 

If the term in the far right side of [Equation 6] is 
developed, the term including g 2 may be omitted as the 
intensity of insertion is sufficiently small, thus the 
equation can be adjusted to the following [Equation 7] 
[Equation 7] 
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2 *?co - 2 ^<o + 2s 2 + 2* s ^ 

Wherein, if defined as the following [Equation 8] , 
[Equation 8] 

2*?(0-2*fo> . 2*? +2*? 

[Equation 9] 




an equation as the above from [Equation 2] is 
established. Accordingly, [Equation 7] is adjusted as the 
following [Equation 10] , and if value of g from [Equation 
10] is obtained, the value of intensity of insertion g is 
represented as [Equation 11] 
[Equation 10] 

[Equation 11] 

45 2 i - e»p> 

In this regard, the intensity of insertion g is 
obtained, i.e., a value for obtaining audio signal s'(t) 
which is given modification for the original audio signal 
s(t) so that the characteristic value F of the current audio 
signal s(t) has a characteristic value F' identical to the 
quantized value Q . 
<Step 5>: Step S150 
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Embedment of watermark 

When the intensity of insertion g is obtained such as 
from [Equation 11] , [Equation 4] is applied to the original 
audio signal s(t) and thus the modified audio signal s'(t) 
5 is obtained. Such modified audio signal s' (t) has the 
quantized characteristic value F' . 

By obtaining the modified audio signal s' (t) , the step 
for embedding watermark according to the present invention 
is completed. The audio signal s' (t) obtained thereby is 
10 recorded in the storage media (150) (S160) . At this time, a 
separate process for compressing audio data may be carried 
out before recording. 

The signal used for obtaining audio characteristic 
value F at the step of initiating the above procedure may 
15 use part of frequency of the audio signal. That is, without 
using a full audio signal, only a certain particular band of 
1 kHz may be used. Accordingly, if disclosure is not made as 
to which frequency has been used, it becomes very difficult 
to identify or search watermark. Also, [Equation 1] used for 
20 obtaining audio characteristic value F and [Equation 3] 
defining the level of quantization can be variously modified, 
and such modification can be a method for increasing 
stability of watermarking. Further, by modifying several 
parameters as such, methods for embedding watermark can be 
25 infinitely increased. 
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For instance, the above equations can be modified as 

below. 

With regard to [Equation 1] , it can be substituted 
with the following [Equation 12] . 
[Equation 12] 



Values S A and S B in the prior step for obtaining 
characteristic value F in [Equation 1] are obtained by 
addition of the square of the audio signal in range A and 



[Equation 12] are obtained by absolute values of the audio 
signal in range A and range B, respectively. 

In this regard, for selecting the intensity of 
insertion g to satisfy the above [Equation 5] , [Equation 12] 
is inserted into [Equation 2], and s(t) in [Equation 2] is 
substituted with s' (t) in [Equation 4] . After rearranging 
the above, a procedure similar to the aforementioned step is 
carried out and as a result, the intensity of insertion g is 
obtained such as from the following [Equation 13] . 

[Equation 13] 



Moreover, the standard values O 0 and Q x of quantization 
can be, for instance, modified as the following [Equation 




range B , respect ively . 



However, values S A and S B in 



14] . 
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[Equation 14] 

Go = [-0.75, 0.25] 
2, — [-0.25, O.T5] 

Such modification is exemplary, and according to the 
content or object of the design, various modification can be 
made. If information with regard to such modification is not 
exposed, it will be difficult for unapproved hackers from 
the outside to extract information embedded in the copyright 
articles. Accordingly, stability of algorithm can be 
strengthened . 

During the process of embedding watermark, silence 
existing at the beginning part of music is one of the 
factors that should be taken into consideration. Since 
silence has a very weak strength of signals and also its 
extraction is difficult even if information is embedded, 
silence signals are not used and it is preferable to embed 
information, starting from the part where the audio signals 
are generated. 

Generally, most of the beginning part of audio has 
silence from one second to several seconds. Research for 
identifying such silence has been actively carried out in 
the field of analyzing audio signals. Herein, generally, 
histogram, energy function, SVF (spectral variation 
function), etc. are usually used, and particularly, a 
technology for identifying silence is also used for 
analyzing syllables or phonemes of audio signals. 

23 



Herein, silence refers to sound inaudible to the ears 
of the humans. That is, even the noise is treated as sounds 
having meanings if its sound is very big. The reason for 
simplifying the steps for identifying silence very much 
5 includes first, restriction with regard to the time for 
identifying silence, second, search for a simple and 
accurate method for maximizing credibility with regard to 
identification of silence, and third, rare application of 
signal segmentation used for audio signals to music. 

10 One of the procedures which should precede for 

detecting watermark together with the silence identifying 
procedure is a procedure for synchronization. Herein, for 
synchronization, it would be enough to enable the position 
of the frames into which information is embedded to be 

15 aligned within the error of 5-10% unlike a spread spectrum 
method which does not allow any error in one or two sample 
units . 

Accordingly, at the initiation of detection of 
watermark and during the process of the detection, whether 

20 synchronization is consistent should be checked. As errors 
may occur during the aforementioned silence identifying 
process, at first, a synchronization must be conformed, and 
2 to 3 additional conformation of synchronization should be 
made during the process of detecting information so as to 

25 prevent spread of errors which may occur when the 
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synchronization does not conform. 

The signal within 16 bits to 20 bits is embedded as a 
synchronizing signal in the same manner as that of embedding 
watermark. If a 16 bit of synchronizing signal is repeatedly 
5 detected by moving by 3-5% of the length of the frame in 
order to detect a synchronizing signal, a graph showing a 
correlation of the synchronizing signal is obtainable. At 
this time, the center of the area having the highest 
correlation is determined as a meeting point of a 

10 synchronizing signal. According to the result of detecting 
the synchronizing signal by moving 3% of the length of the 
frame, a high correlation having a synchronization error 
within 15% can be obtained. 

In the above embodiment, an example is described 

15 wherein a single bit is inserted into each frame of the 
audio signal. However, a plurality of bits of information 
may be inserted into one frame. Also, in case that a large 
amount of information must be inserted in a short audio 
signal, if several band signals are extracted from the 

20 filtering step and bit information is inserted into each 
band signal, 2 to 3 bits may inserted into a single frame. 
Of course, in this case, the band must be established so as 
not to generate interference between filtered signals, and 
more caution is required than at the time for determining 

25 intensity of insertion. 
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After detecting watermark, together with embedment of 
pattern information, error detecting code or error 
correcting code for the pattern information inserted may be 
inserted in order to increase credibility for verifying 
5 alteration/forgery. For instance, if one bit of information 
is inserted into each frame, 88 bits including 16 bits for 
CRC are inserted in order to insert 72 bits of information. 
If 88 bits are encoded in a turbo code, 270 bits are 
generated, and thus for 72 bits, 270 bits with a size three 
10 times bigger are inserted. 

In the experiment of the present invention, by using a 
size of a frame as 80ms, 270 bit information is inserted 
into an audio signal which is about 25 seconds long. If the 
audio can be regenerated for a length of 3 minutes, 
15 information is inserted repeatedly by about 7 times. 

The reason for inserting CRC (Cyclic Redundancy Code) 
together with the actual information is to authenticate that 
the detected information is identical to the inserted 
information. In this regard, in case that the incorrect 
20 information is detected, if it is recognized as a correct 
value, it may be a fatal weakness to the entire system. 

The process for extracting the inserted pattern 
information is almost similar to the process for inserting. 

Fig. 4 illustrates the steps for identifying 
25 alteration/f orgery according to the process for extracting 
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pattern information. 

If it is desired to extract the information inserted 
into the arbitrary audio signal, similar to the process for 
embedding, the lengths of silence and noise signals are 
identified and removed, and filtering is performed out from 
the part from which the actual sound signal begins with a 
band filter used in the process for embedding (S210) . Such 
silence identifying process and/or filtering process can be 
omitted similarly to the time for embedding the watermark. 
However, if the silence identification and/or filtering is 
performed when embedding watermark, it is preferable to be 
also performed when extracting watermark. Further, in order 
to improve credibility at the time of detecting watermark, 
the identification of silence should be performed in the 
same manner as for embedding watermark, and the frequency 
for filtering should be also as the same as that for 
embedding watermark. 

The next step is to evaluate characteristic value F of 
the filtered audio signal (S220) . The characteristic value F 
is evaluated in the same manner as for embedding watermark. 
That is, if the characteristic value F is evaluated 
according to [Equation 2] at the time of embedding watermark, 
the characteristic value should be identically evaluated 
according to [Equation 2] at the time of extracting 
watermark. 
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After evaluating the characteristic value F # the 
characteristic value F is compared with each of the 
quantized values within the set Q which has quantized values 
Herein, the set Q of quantized values should also be the 
same as that used at the time of embedding watermark. 

As a result of comparison, the quantized value closest 
to the characteristic value F is determined. One of the 
methods for determining such quantized value is that by 
determining which element of the set of quantized values a 
characteristic value is closest to, the degree of the 
approximation is obtained as a value within -1.0 to 1.0 is 
obtained. For example, if F is 0.15, F is close to 0.1 which 
is the element of set Q 0 and 0.3 which is the element of set 
Qi. Fig. 5 illustrates such relation. 

As illustrated in Fig. 5, 0.1, the value closest to 
the characteristic value F in set Q 0 and 0.3, the value 
closest to the characteristic value F in set Qi correspond to 
-0.1 and 1.0, respectively. Then, 0.15, the current 

characteristic value F corresponds to -0.5. Corresponding to 
-0.5 means that the current characteristic value is closer 
to Q 0 than Q lt which means that the corresponding bit value 
of the embedded pattern information is "0" . That is, in 
opposite, in order to embed information having a bit value 
of "0" during the process of embedding watermark to its 
corresponding bit string, it can be known that processes of 
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quantization, evaluation of intensity of insertion, and 
modification of audio signal are carried out. 

By performing such processes for every frame, a bit 
string of the pattern information embedded into each frame 
5 can be obtained in a sequence. (S23 0) 

As aforementioned, if the embedded pattern information 
is 72 bits, a total of 270 bits of information into which 
CRC code and turbo code are added are embedded. Hence, after 
obtaining 270 bits of information in a sequence as above, 
10 through a process for decoding for the turbo code, 72 bits 
and CRC 16 bits which are embedded information can be 
obtained . 

Finally, decoding for error correcting code and/or 
error detecting code which are inserted together with the 

15 pattern information is performed. If CRC was inserted as an 
error detecting code at the time of embedding watermark as 
aforementioned, the extracted watermark information is 
examined through a CRC check as to whether the extracted 
information is consistent with the actually embedded 

20 information, and thus whether the watermark has been 
altered/forged is identified. (S240) If, as a result of the 
CRC examination, the information is consistent, the embedded 
information is outputted. Otherwise, a word "NONE" is 
outputted and continuously watermark is detected. In the 

25 present invention, once watermark is embedded, all of the 
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watermark information is intended to extract. However, in 
the actual system, if the watermark is detected, the process 
for extracting is completed. 

If the extracted pattern information is consistent 
with the embedded pattern information, it can be determined 
that any modification such as alteration/forgery has not 
been made to the audio signals. At this time, it is not 
required for the degree of consistency to be completely 
consistent, and if a certain critical value, for example, at 
least 80% of the degree of consistency is exceeded, it is 
considered consistent. If the degree of consistency is 80% 
or below, it can be considered that the original audio 
signals have been altered/forged. 

Further, as discussed above in an example of various 
modification example at the time of embedding watermark, a 
signal to be watermarked at the time of embedding watermark 
are filtered through a plurality of ranges of frequency with 
a respectively different range of band. Also, if each 
pattern information is embedded for every filtered range of 
frequency such that a plurality of bits is inserted into a 
single frame, a task for extracting each bit of pattern 
information should be also performed at the time of 
extracting watermark. In order to do so, before evaluating 
the characteristic value, a signal is filtered respectively 
through a plurality of ranges of frequency identical to the 
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filtering frequency at the time of embedding watermark, and 
processes of evaluation of characteristic value (S220) , 
extraction of pattern (S230) and identification of 
alteration/forgery (S240) , etc. should be performed with 
5 regard to each filtered signal. 

Although the preferable embodiments of the present 
invention are illustrated and described in the above, the 
scope of the present invention is not limited to the 
aforementioned particular embodiments, and a person skilled 
10 in the pertinent art can work various modification within 
the scope that does not deviate from the spirit of the 
present invention. 



31 



